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. . 1 TGGAGGAGCC AGCGGAAGGA CGGTGTGCGG GCCGGCCAGC CCTGGACGAA ' . 

51 AGAAGAGGGC CCCTCCAGGC CAGTCTGGGC ACCCTGGGAT AGCGGCTGCA 
101 GCCAGGCATG GCCGACTCTG CACAGGCCCA GAAGCTGGTG TACCTGGTCA . 
151 CAGGGGGCTG TGGCTTCCTG GGAGAGCACG TGGTGCGAAT GCTGCTGCAG 
.201 CGGGAGCCCC GGCTCGGGGA GCTGCGGGTC TTTGACCAAC ACCTGGGTCC 
251 CTGGCTGGAG GAGCTGAAGA CAGGTACCCG GAACGTGATC GAGGCTTGTG 
301 TGCAGACCGG AACACGGTTC CTGGTCTACA CCAGCAGCAT GGAAGTTGTG 
351 GGGCCTAACA CGAAAGGTCA CCCCTTCTAC AGGGGCAACG AAGACACCCC 
. 401 ATACGAAGCA GTGCACAGGC ACCCCTATCC TTGCAGGAAG GCCCTGGCCG 
451 AGTGGGTGGT CCTGGAGGCC AACGGGAGGA AGGTCCGTGG GGGGCTGCCC 
501 CTGGTGACGT GTGCCCTTCG TCCCACGGGC ATCTACGGTG AAGGCCACCA 
,551 GATCATGAGG GACTTCTACC GCCAGGGCCT GCGCCTGGGA GGTTGGCTCT 
.601 TCCGGGCCAT CCCGGCCTCT GTGGAGCATG GCCGGGTCTA TGTGGGCAAT 
651 GTTGCCTGGA TGCACGTGCT GGCAGCCCGG GAGCTGGAGC AGCGGGCAGC 
701 CGTGATGGGC GGCCAGGTAT ACTTCTGCTA CGATGGATCA CCCTACAGGA : . . 

751 GCTACGAGGA TTTCAACATG GAGTTCCTGG GCCCCTGCGG ACTGCGGCTG " . 
801 GTGGGCGCCC GCCCATTGCT GCCCTACTGG CTGCTGGTGT TCCTGGCTGC 
851 CCTCAATGCC CTGCTGCAGT GGCTGCTGGG GCCACTGGTG CTCTACGCAG . 
901 CCCTGCTGAA CCCCTACACG CTGGCCGTGG CCAACACCAC CTTCACCGTC ' 
951 AGCACCGACA AGGGTCAGCG CCATTTCGGC TATGAGCCCC TGTTCTCGTG 
1001 GGAGGATAGC CGGACCCGCA CCATTCTCTG GGTAGAGGCC GCTACGGGTT 
1051 CAGCCCAGTG ACGGTGGGGC TGGGGCGTGG . AGGCCCAGAT ACAGCACATC 
1101 CACCCAGGTC CCGAGCCCTC ACACCCTGGA CGGGAAGGGA GAGGTGCATT ' 
1151 CCAGAGCAGG AGGCAGGGCT CTGGGGGCAG AATGGCTGTC CTTGTCGTAG 
1201. AGCCCTCCAC ATTTTCTTTT TCTTTTTTGA GACAGGGTCT TGCTCTGTCA 
1251 CCCAGACTGG AGTGCAGTGG TGTGATCATA GCTCACTGCA CCCTCAACCT 
,1301 CCTGGGTTCA AGCAATCCTC CTGCCTCAGC CTCCTGAACA GCTGGGACCA 
1351 CAGGTG CACG CCACCATACC TGGCTTTTTT TTGTTGCTTT TAGAGACAGG 
1401 GTCTCACTAT ATTGCTCAAG GCTGGACTTG AACTCCTGGG CTCAAGTGAT • 
1451 CTTCCCACGT GGGCCTCCCA AAACGCTGGA ACTACAAGTG TGAGCCACCG 
1501 CGCCTGGCCC, ACCGCCTCTC CACATTTTCA ATCCAGGAGC CTTGAGTCTG 
1551 TGGCTGTGTC CTGACACCTC CAGAGTTCTG AGGGCCGTCA GGACACGGGA 
1601 GGGTTTGGGG ACAGAGTGTC CTTCCTCTGT CCTATCATCA CCAGTGCTGA 
1651 TGGCCGCTTG GTGAGTGTCT GGTGCCCTGG TGGCTTGCCC CAGCTCTCTT 
1701 GTGGCTTTCT GAGCAGGAAG CGAGCACTAG GCTCCACAGG CTTACGCTGT 
1751 GTCTCCTGCC AGCCACACAG CGACCCATCG GTGCAGAGTG . CAGACGCGGG 
1801 TGTGGTTCCT CCAGCCCACC TCAGTCCCTC TTTGGGAGGT. GATGTTCCCA 
1851 TTGTTTTTCA AAGGGGTCAC CTTCAACTGT TCTGTTTTAG AATTCCCCTC 
1901 TGGAGGGCTA TGGCCTCCCT ATGGTTTCAC TTCCGACCTA CTTCTACCTA 
1951 AGTTCCTTCC CAGCACATCG CCAGCCCTGG GCCTGGGGAT GTCCCCAATG 
2001 CTGTACCTGG CTGACCCCGG ATTAAAAGCC TCATCCACGA AAAAAAAAAA 

2051 AAAAAAAAAA AAAAAAAAAA A ... 
(SEQ ID NO: 1) 

FEATURES: 

5 ? UTR : 1-107 

Start Codon: 108 

Stop Codon: 1059 

3 'UTR: 1062 

HOMOLOGOUS PROTEINS: 

Top 10 BLAST Hits: 

CRA|335001098696094 /al tid^gi | 11545403 /def =gb | AAG37824 . 1 | AF277 . . . 638 .0.0 

CRA j 335001098696092 /altid^gi | 11545401 /def =gb | AAG37823 . 1 j AF277 . . . 562 e-159 

CRA|18000005106837 /altid=gi | 2563999 /def =dbj | BAA22931 . 1 | (AB00... 484 e-136 

CRA| 18000005043125 /altid=gi | 9629084 /def =ref | NP_044103 . 1 | MG15 . . . 269 3e-71 

CRA| 89000000192042 /altid=gi | 9634716 /def =ref | ^rP_039008 . 1 | ORF ... 257 2e-67 

CRA| 18000004899504 /altid=gi | 540666 /def =pir j j| S41971 3beta-hydr. . . 240 le-62 

CRA| 335001098644340 /altid=gi | 11251676 /def =pir | | T37430 hydroyx. . . 213 , 2e-54 

CRA| 18000004942649 /altid=g'i | 112779 /def =sp | P26670 | 3BHS_VACCV 3... 212 4e-54 
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CRA| 18000004942648 /aitid=gi | 9791111 /def =ref |NP_063838 . 1 | A44L . . . 211 9e-54 
CRA| 73000005493670 /altid=gi j 9634564 /def =ref | NP_038102 . 1 | TA55 . . , 211 9e-54 
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EST: 



gi | 11283574 /dataset=dbest /taxon=96 . . . 
gi|ll643588 /dataset=dbest /taxon=96. 
gijl3134586 /dataset=dbest /taxon=960 . . .- 
gi I 9334685 /dataset=dbest /taxon=960 . . . 

EXPRESSION INFORMATION FOR MODULATORY USE: 

gi | 11283574 brain 
gi | 11643588 kidney 
gi 1 13134586 colon 
gi | 9334685 uterus 
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1 MADSAQAQKL VYLVTGGCGF LGEHWRMLL QREPRLGELR VFDQHLGPWL 
51 EELKTGTRNV I EACVQTGTR FLVYTSSMEV VGPNTKGHPF YRGNEDTPYE 
101 AVHRHPYPCS KALAEWLVLE ANGRKVRGGL PLVTCALRPT GIYGEGHQIM 
151 RDFYRQGLRL GGWLFRAIPA SVEHGRVYVG NVAWMHVLAA RELEQRAALM 
201 GGQVYFCYDG SPYRSYEDFN MEFLGPCGLR LVGARPLLPY .WLLVFLAALN 
251 ALLQWLLRPL VLYAPLLNPY TLAVANTTFT. VSTDKAQRHF GYEPLFSWED 
301 SRTRTILWVQ AATGSAQ , 
(SEQ ID NO: 2) 



FEATURES : 

Functional domains and key regions: 

[1] PDOC0000.1 PS00001 ASN__GLYCOSYLATION • 
N-glycosylation site 

276-279 NTTF 

[2] PDOC00005 PS00005 PKC_PHOSPHO_SITE 
Protein kinase C phosphorylation site 

283-285 TDK 

[3] PDOG00006 VS00006 CK2_PHOSPHO_SITE 
Casein kinase II. phosphorylation site 

Number of matches : 4 

1 76-79 SSME 

2 97-10.0 TPYE 

3 215-218 SYED / 

4 297-300 SWED 

[4] PDOC00007. PS00007 T YR_PHOS PHO_S I TE 
Tyrosine kinase phosphorylation site 

92-99 RGNEDTPY 

[5] PDOC00008 PS00008 MYRISTYL 
N-myristoylation site 

157-162 GLRLGG 

[6] PDOC00009 PS00009 AMI DAT I ON 
Amidation site 

122-125 NGRK r . 

[7] PDOC00029 PS00029 LEUCINE_ZIPPER 
Leucine zipper pattern 

246-267 LAALNALLQWLLRPLVLYAPLL 

Membrane spanning structure and domains: 
Helix Begin End Score Certainity 

1 6 26 0.633 Putative 

2 237 : 257 1.571 Certain 

3 .260 280 0.819 Putative 
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BLAST Alignment to Top Hit: 

>CRA| 335001098696094 /altid=gi | 11545403 

/def=gb|AAG37824.l|AF277719_l (AF277719) 3 

beta-hydroxy-delta 5-C27-steroid oxidoreductase [Homo : ~ 
sapiens] /org=Homo sapiens /taxon=9606 /dataset=nraa 
/length=369 
Length = 369 

Score = 638 bits (1627), Expect = 0.0 ; 

Identities = 315/369 (85%) , . Positives = 316/369 (85%), Gaps = 52/369 (14%) 

Query : 1 MADSAQAQKLVYLVTGGCOTLGEHWRMLLQREPRLGELRVFDQHLGPW - : 56 

MADSAQAQKLVYLVTGGCGFLGEHWRMLLQREPRLGELRVFDQHLGPWLEELKTG 
Sbjct: 1 MADSAQAQKLVYLVTGGCGFLGEHWRMLLQREPRLGELRVFDQHLGPWLEELKTGPVRV 60 

Query: 57! TRNVIEACVQTG 68 

TRNVIEACVQTG 

Sbjct: 61 TAI QGDVTQAHEVAAAVAGAHWI HTAGLVDVFGRAS PKT I HEVNVQGTRNVI E ACVQTG 120 . 

Query: 69 TRFLVYTSSMEVVGPNTKGHPFYRGNEpT 128 

TRFLVYTSSMEWGPNTKGHPFYRGNEDTPYEAVHRHPYPCSKALAEWLVLEANGRKVRG 
Sbjct: 121 TRFLVYTS SME WGPNTKGHP FYRGNEDTP YE A VlIRHP YP CS KAXjAEWLVLEANGRKVRG 180 

Query:, 129 GLPLVTeALRPTGIYGEGHQIMRDFYRQGLRLGGWLFRAIPASVEHGRVWGOTAWMHVL 188 

' GLPLVTCALRPTGI YGEGHQIMRDFYRQGLRLGGWLFRAI PASVEHGRVWGNVAWMHVL ' 
Sbjct: 181 GLPLVTCALRPTGI YGEGHQIMRDFYRQGLRLGGWLFRAI PASVEHGRVYVGNVAWMHVL 240 

Query: 189 AARELEQRAALMGGQVYFCYDGSPYRSYEDFNMEFLGPCGLRLVGARPLLPYWLLVFLAA 248 

. AARELEQRAALMGGQVYFCYDGSP+RSYEDFNMEFLGPCGLRLVGARPLLPYWLLVFLAA 
Sbjct: 241 AARELEQRAALMGGQVYFCYDGSPHRSYEDFNMEFLGPCGLRLVGARPLLPYWLLVFLAA 300 

Query: 249 LNALLQWLLRPLVLYAPLLNPYTLAVANTTFTVSTDKAQRHFGYEPLFSWEDSRTRTILW 308 

LNALLQWLLRPLVLYAPLLNPYTLAVAN TFTVSTDKAQRHFGYEPLFSWEDSRTRTILW 
• Sb j c t : 301 LNALLQWLLRPLVLYAPLLNPYTLAVANATFTVSTDKAQRHFGYEPLFSWEDSRTRTILW 360 

Query: 309 VQAATGSAQ 317 

VQAATGSAQ , 
Sbjct: 361 VQAATGSAQ 369 (SEQ ID NO: 4) 

Hmmer search results (Pfam) : 

Scores for sequence family classification (score includes all domains): 
Model Description Score E-value N 



PF01073 3-beta hydroxys teroid dehydrogenase/isomeras 558 .2 5.5e-164 2 

PF01370 NAD dependent epimerase/dehydratase family 13.3 0.005 1 

PF00438 S-adenosylmethionine synthetase 1*8 0.78 1 

Parsed for domains : 

Model Domain seq-f seq-t hmm-f hmm-t score E-value 

PF01370 1/1 12 31 .. 1 20 [. 13,3 0.005 

PF01073 1/2 1 52 [. 1 53 [. 69.6 3.9e-18 

PF00438 1/1 . 289 299 .. 365 .376 .] 1.8 0.78 

PF01073 2/2 55 313 . . 159 425 .] 489.6 2.4e-143 
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1 ATTTGCATTA GCCGGTGGCA GCCAACAGGT GCCTGTTTTG GAGAGAGGTC 

51 CAGGGAGGAG AGATGAGCAG GGTGCCGTTG GTGACATGGC CAGTCATTTC 

101 AGGAGCTGCC CCAACCCCAG ACTTGCCCCA GCAGTCCGGG ACCCCACTGT 

151 GACCAGGCAG ATGCTCGAAG GAGTGAGTGG CTCTCTTACC CAGTGCAGAT 

201 TTCCCTGGAG TTCCCTGCGG GTGACTTAGA ATGGCCACCA GAGGCTTAGG 

251 ATGCTGCCCC AAAGAGGGAG GGCTCCTGGA AGCAGAGTCG AGAGAGTCAG 

3 01 TGCCGGGTTA GCGGGAGCTG GAGGCAGAGC TGCAGCTCCA GGCCTGGTGG 
351 GCGTGGACCT GGGGTGCTGG CTGGCAGGCG TGCTCAGGGG CAGGAAGTGG 

4 01 GGGACTCTTC CCTGACCATC GCATCTCACC. CTGGCAGATG GTGGCCGACA 
451 TGCGGGAGAA GCGCTACGTG CAGGAGGGCA TTGGCAGCAG CTACCTGTTC 
501 CGGGTGGACC ACGACACCAT CATCGATGGC AGCAAGTGTG GCAACCTGGC 
551 CAGATTCATC AACCACTGCT GCACGGTGCG CCAGGGGCCA GCCGGGGCAG 
601 GAGTTGGGGG TCGGTGGGGG TGGCCACGGC TCACACGCCC TTCCATCCGC 
651 AGCCTAACTG CTACGCCAAG GTCATCACCA TCGAGTCCCA GAAGAAGATC 
701 GTGATCTACT CCAAGCAGCC CATTGGCGTG GACGAGGAGA TCACCTACGA 
751 CTACAAGTTC CCACTGGAAG ACAACAAGAT CCCGTGTCTG TGTGGCACAG 
801 AGAGCTGCCG GGGCTCCCTA AACTGAGGTG GGGCAGGATG GGTGCCCACA • 
851 CCCCTATTTA TTCCCCCTGG TGCCCTGAGC TCCCAGCACC CCCCCAGCCT 
901. TAGTGGGCTC AGCAGGGCCC ACATGCCCCC ATCTCCAAGC GTGGGGTTGG 
951 GGGCCCCAAG CCCAGCGAGG GAGCCTCAGT CCCTGGAGGC AGCTTCTGCC 

1001 TCTGCTGTCG CCCGTGCCCA CCACCGCCTG ATTGTTTTTC TTTGCGGAGA 
1051 AGAAGCTGTA AATGTTTTGT AGCAGCCAGC AGCTGTTTCC TGTGGAAACC 
1101 TGGGGTGCCG GCCTGTACAG ATTCTGTCCT GGGGGGCTAC ACAGTCCTCT 
1151 CGCTTTGTGT TAATGGGGAC TTCCCCTTAC GCCCTGCGTG TACCCCTCCC 
1201 CAGTTTAGGG GTCTCTGGGG GAGTGGCCAT GTTCTCCCCC TGGGGGGGCT 
1251 CTGCACCCCC AGTCCTGGGG ACTCCGTGCC TGGAACCCTG CGTCATCTGT 
1301 TCCTGCCAGA CCCTGAGGGT CACCCTTCCA CCCTGGTGTC ACTCCCCGGC 
1351 TCAGCCAGGG CAGGATGGCG GGGTGGGTCC CTTTTGCTGG GCTGGACTGT 
1401 ACATATGTTA ATAGCGCAAA CCCGACGCCA CATTTTTATA ATTGTGAT T A 
1451 AACTTTATTG TACAAAAGTG TTTGGTCGGT GTATTTGGGC AGGAGCGAGG 
1501 GGTTGGGGGT AGAGGGCACG GAGGGTTGTG CAAGTTGAAG AGAGGGAAAA 
1551 GTGGGTACCT GAAGTGTGGG GCAGGTAAAG GGGCCTTCAG GCAAGAGCCC 
1601 AGACCTGCAG AGACAGTCCG AGACTGTCTC GGACCCCCTG ACAGGCTGCA 
1651 GGAGCGGCAC CCGGACCAGG AATACGCCAC CAGTGCCCGC GAGGGTGGTG 
1701 CCAAGGTCAG GCCTCCCCTT GCTACAATCA CAGCTGCAGC TGGACCTCCG 
1751 GCCTCCTGGG AAGCCCAGCA GGAGGGAAGG CCTGAGGTCA CACTGTGGGA 
•1801 TGAGGTCACC GCTGGCTCCA CCCACAGCCC CAGACCCCTT CAGCCCACTC 
1851 TGCAAGTTCG AGCTTCATCC CCACCAAGTT CTCCGCTGGA CCCAGATGGC 
1901 AGTGGAGCAC AGAGCGGCCG CCAGGGGGCG CCTTGGGGCA AGAGTGGTGG 
1951 GGGTTGTGGC TGGGGGGGTC TCTGTTCCTG GAATGGGGCA GGAGGGAGAA 
2001 GGAGGAGCCA GCGGAAGGAC GGTGTGCGGG CCGGCCAGCC CTGGACGAAA 
2051 GAAGAGGGCC CCTCCAGGCC AGTCTGGGCA CCCTGGGATA GCGGCTGCAG 
2101 GTAGGCAGAG GCGCTGCCAG TGCCCAGGTG GCCTTTCCCT CCATCCGGCC 
2151 CTTCGCACCT TCCTATAACC TTCCCTCCAC GTCCCTCAAC TGCTGGCCTC 
2201 CCCACCCTTT TACTGCCTTG AAATCTCTCT CCCTAAACCC TGACGCCTTC 
2251 CTGCAGCCCA AGCCCGCCCC TCTCTCCGTA ACTCAGCCAT CAGCAGGGGC 

23 01 AGACGGCAGG TGGCCTGGTT GCTGCAGCTC CCAGGATCAG CTCTGCCCTG 
2351 GGGGCAAACG CCAGCCTCGT CACCGCTCCA GGGCACCTCG AGCAGTAACA 

24 01 GGTGGTTGCA GCAGGTGGGA GCCAGCCCCT GGATGAGCCA AGGTCTCTTC 
24 51 CCCAGCCAGG CATGGCCGAC TCTGCACAGG GCCAGAAGCT GGTGTACCTG 
2501 GTCACAGGGG GCTGTGGCTT CCTGGGAGAG CACGTGGTGC GAATGCTGCT 
2551 GCAGGGGGAG CCCCGGCTCG GGGAGCTGCG GGTCTTTGAC CAACACCTGG 
2601 GTCCCTGGCT GGAGGAGCTG AAGACAGGTT CTTGTTGGGG GAGCTTGTGG 
2651 TGGAGAGGGT GTGGACGCTT CCCCAACCCT TCCCAAGCTG GGATCCCCAC 
2701 CCCTGCAGTG GAACAGATGA TGCTGGTTTC TGTCCACATG GATGGGTGGA 
2751 GTGAGTC AC A TTGGGAACGT GACTCCAGGG TGGAAGATGA ACCCAGCCTC 
2801 TGGCCTCTGG CCCCAGCTCT GACATGGCCT GTGTCCTCCA ACGCCGGCCA 
2851 GGGCCTGTGA GGGTGACTGC CATCCAGGGG GACGTGACCG AGGCCCATGA 
2901 GGTGGC AGCA GCTGTGGCCG GAGCCCATGT GGTCATCCAC ACGGCTGGGC 

2 951 TGGTAGACGT GTTTGGCAGG GCCAGTCCCA AGACCATCCA TGAGGTCAAC 
3001 GTGCAGGGTG AGGAGCTCTG GACACTCCTG GCCATCTTGC CTGTTTGTTC 

3 051 CCCACTCTGT CTTTGGCCTT GACCTCCGGT GACTCGCGTG GGACAAGTTG 
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3101 TCCTATTGAC AGCCCTGCCC CCGCCTCCCC TGACCTGTCA TGGTTTTCCC 
3151 TGGACCTGGG ATGGGGAGGA GGAAGATGCA GAGAGGGAAG AAGCTGCAGC 
3201 TTGGATACGC CTCCTCCTCT GCCAGGTACC CGGAACGTGA TCGAGGCTTG 
3251 TGTGCAGACC GGAACACGGT TCCTGGTCTA CACCAGCAGC ATGGAAGTTG 
3301 TGGGGCCTAA CACCAAAGGT CACCCCTTCT ACAGGTGAGT GGCAGGCCCT 
3351 CTTGTCCTCT AAGAGCCCAT TTCCCTCAGC ATTGAGTCTT CCTTCTCCTC . 
3401 CCACCAGGGG CAACGAAGAC ACCCCATACG AAGCAGTGCA GAGGCACGCC 
34 51 TATCCTTGCA GCAAGGCCCT GGCCGAGTGG CTGGTCCTGG AGGCCAACGG 
3501 GAGGAAGGTG AGCCCAGAAA AAGGAGGCGC AGAGATGGGG CTCCTGCCCT 
3551 GCACACCCCC TTACCCTGCC AGCCCAAGGA GGCCGGGGCC GAGAGCAAGC 
3601 TGTCGGGTCC CAGGTCTCAG CAGTACCTGC CTTTGCCACC , AGGTCCGTGG 
3651 GGGGCTGCCC CTGGTGACGT GTGCCCTTCG TCCCACGGGG ATCTACGGTG 
3701 AAGGCCACCA GATCATGAGG GACTTCTACC GCCAGGGCCT GCGCCTGGGA 
3751 GGTTGGCTCT TCCGGGCCAT CCCGGCGTCT GTGGAGCATG GCCGGGTCTA 
3 801 TGTGGGTGAG GACTGGGCTA GGCAGGGGGA GGCTGAGAAT ATGGCAGGAG 
3851 GACTTGCTCT AGAAGGGGGC AGGACCCACA TGGCCCTGGG AGAGAAGTGT 
3 901 GGACTCTGGC TAGAAAAATA TGGTCTATAC ATGGGCCAAG , GTAGACTGTG 

3 951 ATTATGTCTC CACAGCCTGC AGAGAATACA GGATCCATGC AAGTTGGGAC 
4001 ATTAAAAAGT GTATCATAGG CTACAGAGAA GATTGCAGCT ATGGGAGCAG 
4051 CCATTCCCCA GGAGAGGAGA GGAGAGGGAC AGTGTGTACA CAGCACTAAA 
4101 AGGGCTGGGT TCAGTGGCTC GCATCTATAA TCCCAGCACT TTAGGAGGCT 
4151 GAGGCGGGAG GATGGCCTGA GCCCAGGAGT TGGAGGGTGC AGTGAGCTAT 
4201 GACCGCACCA CTGCACTCCA GCCTGGATGA CAGAGACAGA CCCTGTCTCT 
4251 AAAACTTTTT TTAAAGGAAG TAGCATCTAC ACAGGGAATA AGGTCACCTG 
43.01 CCACTCCATC CTGCAGTCCC CAAGCCTCTC AGGGCCCACC ACGCAGGTCC 
4351 TGGTTTCTCT ATCCTCTCCC CAGGTTCTTT GCAGATGCAG GCTGGCCCAG 
44 01 GAGAGCAAGT GACTACCAGG GCGAGGGAGA AGGCAGCCTT TCCCAGGCTG 
4451 CTGTGGGGAT GTGGGCGGCA ACTACCTGGG CCCAAAGAGG GGGTGGGCCA 
4501 GGAGAGCAGC CTCGATGTGG TGTTGCAAGG GCACTCAGGG GTGTGTCCGC 
4551 m CTCTCTTCCG CCACCGGCAG GCAATGTTGC CTGGATGCAC GTGCTGGCAG 
46 01 CCCGGGAGCT GGAGCAGCGG GCAGCCCTGA TGGGCGGCCA GGTATACTTC 
4651 TGCTACGATG GATCACCCTA CAGGAGCTAC GAGGATTTCA ACATGGAGTT 
4701 CCTGGGCCCC TGCGGACTGC GGCTGGTGGG CGCCCGCCCA TTGCTGCCCT 
4751 ACTGGCTGCT GGTGTTCCTG GCTGCCCTGA ATGCCCTGCT GGAGTGGCTG 

4 801 CTGCGGCCAC TGGTGCTCTA 1 CGCACCCCTG GTGAACCCCT ACACGCTGGC 
4851 CGTGGCCAAC ACCACCTTCA CGGTCAGCAC CGACAAGGCT CAGGGCCATT 
4901 TCGGCTATGA GCCCCTGTTC TCGTGGGAGG ATAGCCGGAG CCGCACCATT 
4951 CTCTGGGTAC . AGGCCGCTAC , GGGTTCAGCC CAGTGACGGT GGGGCTGGGG 
5001 CCTGGAGGCC CAGATACAGC ACATCCACCC AGGTGCCGAG CCCTCACACC 
5051 CTGGACGGGA AGGGACAGCT GCATTCCAGA GCAGGAGGCA GGGCTTCTGG 
5101 GGCCAGAATG GCTGTCCTTG TCGTAGAGGG CTCCACATTT TCTTTTTCTT 
5151 TTTTGAGACA GGGTCTTGCT CTGTCACCCA GACTGGAGTG GAGTGGTGTG 
5201 ATCATAGCTC AGTGCACCCT CAACCTCCTG GGTTCAAGCA ATCCTCCTGC 
5251 CTCAGCCTCC TTGAACAGCT GGGACCACAG GTGCACGCCA CCACACCTGG 

53 01 CTTTTTTTTG TTGTTTTTAG AGACAGGGTC TCACTATATT GCTCAGGCTG 
5351 GTCTTGAACT CCTGGGCTCA AGTGATCTTC CGACGTGGGC CTCCCAAAAC 

54 01 GCTGGAACTA CAAGTGTGAG CCACCGCGCC TGGCCCAAGC CCTCCACATT 
54 51 TTCAATCCAG GAGCCTTGAG TCTGTGTTGT GTCCTGACAC CTCCAAGTTC 
5501 TAGGGCGGTC AGGACACGGG AGGGTTTGGG GACAGAGTGT CCTTGCTCTG 
5551 TCCTCTCATC CCAGTCCTGA TGGCCGCTTG GTGAGTGTCT GGTGGCCTGG 
5601 TGGCCTGCCC CAGCTCTCTT CTGGCTTTCT GAGCAGGAAG CGAGCAGAGG 
5651 CTCCACAGGC TTACGCTGCT CTCCTGACAG CCACACGCGA CCCTCGGTGC 
5701 AGAGTGCAGA GGGGGCTCTG GTTCCTCCAG CCACCTCAGT CCCTCTTTGG 
5751 GAGGTGATGT TCCCATTGTT TTTCAAAGGC CTCACCTTCA ACTGTCTGTC 
5801 TTAGAATTCC CCTCTGGAGG GCTATGGCCT CCCTATGCTT TCACTTCCCA 
5851 CCTCTCTACC TAAGTTCGTT CCCAGGACAT CGCCAGCCCT GGGCCTGGGG • 
5901 ATGTCCCCAA TGCTGTACCT GGCTGACCCG GGATTAAAAG CCTCATCCAC 
5951 GACCGTGTCC ATCTGTCTGT CCAGCTCTCC CTCCCATCCC CCCACCCCAT 
6001' GTCCGCCTCC CCACGGCGCC GATCCCACGT GGGGACAGT^A GGAAGTGAGC 
6051 ACACGGCACA CCCGCTGTTG GATTGGTTGC TATTTCTCCG GTCCCACAGG 
6101 GCCTGACCTG GCCCAGGGTG GGGTGGGGGG CTCTGGGGAC AGGAC ATGC A 
6151 GGGAGGAAGG GGGGGGCAGG ATTTTCCTGT GTTTTATCCA TTTGCAAGTT 
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6201 GGTCACCAAT 
6251 GGGCTAACAG 
6301 AGTCCTGTCC 
6351 AGAAACAGAG 
6401 CGTGACACGC 
6451 GCGGGGAGGG 
6501 TCTGCAGGGC 
6551 TACACGCTCA 
6601 ATGGACACGC 
6651 GAGGAAGCAC 
6701 TGGGGCACAT 
6751 ATTTGTCCAA 
6 8 0.1,, ACGCACACAG 
6851 CGCATAGAAA 
6901 GCCCAGAGAT 
6951 GGTGCCCCAG 
7001 ACTTCCGGAA 
7051 GAGGGGACTG 
7101 ATGGGAGCAG 
7151 AGAGGGGCGT 
7201 ACCTGGGGTT 
7251 AGCTCGGCTG 
7301 A . 
(SEQ ID NO: 3) 



AGAAATGGGA 
GAGAGGGCGG 
CCTCTGAGAA 
AAGGGGGCAG 
CAGTGACACA 
GCGGTTAGGG 
GGGCTGGGGG 
GGGCGTGGCA 
AGTGTGCTTC 
TGAGCCCTGG 
GTGGGCTGAA 
GAACATGCAA 
ACAGCAGCCA 
GGGCACAGCA 
GGGCAGCTGT 
CCACGGTTCA 
ACACCTGGGC 
GGGGCTGGGG 
GAGGGCCGTG 
TCTGGGAAGG 
ACGCACGTGG 
TCTGAGCTCG 



CTCTGAGGGC 
CCTGGCTCTG 
GGGGAGGGAG 
GGGCTCATGT 
GAAACACACG 
GAAAGGGAGC 
GCAGGGTGAA 
AGAGCGTGCG 
ATGAGGGGTG 
CCAGGCCCGG 
TGGTTGCAGG 
AAGACACCAG 
ACAAGCAGGC 
GACCCTTAGA 
GGGCCCAATG 
ACCCTTCTCC 
TGAATGGGGG 
CCTGCTCTGA 
GCCAGACTTG 
GACAGGCAGG 
CCACACTGAC 
GGTAGAGGTG 



TAACAGAAAT 
GGCCCCAGCC 
AGAGCTCTAG 
CAGCAAACAC 
CCAACGCACA 
CGGGGCCACC 
TGCATAGAAC 
TCGACCCACG 
GGAACAGGGA 
GACCACCCGG 
CGCCTGGGCA 
CCTCCAGACA 
ACATCATAGG 
GATCCGCTGG 
CCACTCCAGG 
TGTGGCCCCA 
TCCTGTCCAG 
TGTCTCCCAA 
GGGCAGACTT 
CCCCCAGCTC 
ACACACACAG 
GAGGGGTAGT 



GGGACTCTGA 
AGGCCCCAGG 
AAACCAACGG 
GGCTACATCA 
CGGCTGCACA. 
CATCTTGTCC 
ACATCATGTG 
GGTACATGGG 
GGAGGGGGAA 
AGGGCACACG 
GGTAGCACAC 
AC ATGCC AGG 
ATGTGGAGGA 
TCCACCTGAG 
TGGGGGGAGT 
AGGCCGTGGG 
GCGGCCGGAA 
GCAGCC GGAG 
CCTGTGCTGC 
AGGACAGCCC 
GACAAGGGAG 
GTGTTCTGGG 



FEATURES : 

Start : 

Exon : 

Intron: 

Exon: 

Intron: 

Exon : 

Intron: 

Exon: 

Intron: 

Exon: 

Stop 



2462 

2462- 

2628- 

3226- 

3335- 

3408- 

3508- 

3643- 

3806- 

4571- 

4984 



2627 
3225 
3334 
3407 
3507 
3642 
3805 
4570 
4983 



SNPS: 



DNA 

Position Major 



Minor 



Domain 



Protein 

Position Major 



Minor 



1010 
1151 
4805 
6293 



A 
C 
G 
G 



G 
T 
A 
A 



Beyond ORF(5') 
Beyond ORF(5') 
ExOn .." 
Beyond ORF(3' ) 



258 



Context : 

DNA . . 

Position 



1010 tccaagcagcgcattggcgtggaggaggagatcacctacgactAcaagttcccactggaa 
gacaacaagatcccgtgtctgtgtggcacagagagctgccggggctccctaaactgaggt 
ggggcaggatgggtgcccacacccctatttattccccctggtgccctgagctcccagcac 
. ccccccagccttagtgggctcagcagggcccacatgccgccatctccaagcgtggggttg 
ggggccccaagcccagcgagggagcctcagtccctggagggagcttctgcctctcctgtc 

[A; G] 

cccctgcccaccacgccctgattgtttttctttgcggagaagaagctgtaaatgttttgt 
agcagccagcagctgtttcctgtggaaacctggggtgccggcctgtacagattctgtcct 
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GGGGGGCTACACAGTGCTCTGGCTTTGTGTTAATGGGGACTTGCCCTTACGCCCTGCGTG 
TACCCCTCGCCAGTTTAGGGGTCTCTGGGGCAGTGGCCATGTTCTCCCCCTGGGGGGGCT 
CTGCAeceCCAGTCCTGGGGACTCCGTGGCTGGAACCCTGCCTCATCTGTTCCTGCCAGA 

CCCCTATTTATTGCCCCTGGTGCCCTGAGCTCCCAGCACCCCCCCAGCCTTAGTGGGCTC 
AGCAGGGCCCACATGCCCCCATCTCCAAGCGTGGGGTTGGGGGCCCCAAGCCCAGCGAGG 
GAGCCTCAGTCCCTGGAGGCAGCTTCTGCCTCTCCTGTCGCCCCTGCCCACCACCGCCTG 
ATTGTTTTTCTTTGCGGAGAAGAAGCTGTAAATGTTTTGTAGCAGCCAGCAGCTGTTTCC 
TGTGGAAACGTGGGGTGCCGGCCTGTACAGATTCTGTCCTGGGGGGCTACAGAGTCCTCT 
[C,TJ 

GCTTTGTGTTAATGGGGACTTCCCCTTACGCCCTGCGTGTACCCCTCCCGAGTTTAGGGG 
TCTCTGGGGCAGTGGCCATGTTGTCCCCCTGGGGGGGCTGTGGACCCCCAGTCCTGGGGA 
CTCCGTGCCTGGAACCCTGCCTCATCTGTTCCTGCCAGACCCTGAGGGTCACCCTTCCAC 
CCTGGTGTCACTCCCCGGGTCAGCCAGGCCAGGATGGCGGGGTGGGTCCCTTTTGCTGGG 
CTGGACTGTACATATGTTAATAGCGC^^ACCCGACGCGACATTTTTATAATTGTGATTAA 

r " '. 

AGCAGCCTCGATGTGGTGTTGCAAGGGCACTCAGGGGTGTGTCCGCCTCTCTTCCGCCAC 
CGGCAGGCAATGTTGCCTGGATGCACGTGGTGGGAGCGCGGGAGCTGGAGCAGGGGGCAG 
CCGTGATGGGCGGCCAGGTATACTTCTGCTACGATGGATCACGCTACAGGAGCTACGAGG 
ATTTCAACATGGAGTTCCTGGGCCCCTGCGGACTGCGGCTGGTGGGCGCCCGCCCATTGC 
TGCCCTACTGGCTGCTGGTGTTCCTGGCTGCCCTCAATGCCCTGGTGCAGTGGCTGCTGC 
[G, A] 'V- 

GCCACTGGTGCTCTACGCACCCCTGCTGAAGCCCTACACGCTGGCCGTGGCCAACACCAC 
CTTCACCGTCAGCACCGACAAGGCTCAGCGCCATTTGGGCTATGAGCGCCTGTTCTCGTG 
GGAGGATAGCCGGACCCGCACCATTCTCTGGGTACAGGCCGCTACGGGTTCAGCCCAGTG 
ACGGTGGGGCTGGGGCCTGGAGGCCCAGATACAGCACATCCACCCAGGTCCCGAGCCCTC 
ACACCCTGGAGGGGAAGGGAGAGCTGCATTCCAGAGCAGGAGGCAGGGCTTCTGGGGCCA 

"6 293 CACCCCATGTCCGCCTCCCCACGGCGCCCATCCCACGTGGGGAGAGAAGGAAGTGA.GGAC 
ACGGCACACCCGCTGTTGGATTGGTTGCTATTTCTCCCGTCCCACAGGGCCTGACCTGGC 
CCAGGGTGGGGTGGGGGGCTCTGGGGACAGGACATGCAGGGAGGAAGGGGGGGGCAGGAT 
TTTCCTGTGTTTTATCCATTTGCAAGTTGGTCACCAATAGAAATGGGACTCTGAGGGCTA 
ACAGAAATGGGAGTCTGAGGGGTAACAGGAGAGGGCGGCCTGGCTCTGGGCCCCAGCCAG 

■ [G, A] ; ; • 

CCCCAGGAGTCCTGTCCCCTCTGAGAAGGGGAGGGAGAGAGCTCTAGAAACCAACGGAGA 
AACAGAGAAGGGGGCAGGGGCTCATGTCAGCAAACACGGCTACATCACGTGACACGCCAG 
TGACACAGAAACACACGCCAACGCACACGGCTGCACAGCGGGCAGGGGCGGTTAGGGGAA 
AGGGAGCCGGGGCCAGCCATCTTGTCCTCTGCAGGGCGGGCTGGGGGGCAGGGTGAATGC 
ATAGAACACATCATGTGTACACGCTCAGGGCGTGGCAAGAGCGTGCGTCGACCCACGGGT 

Chromosome map: 

Chromosome 12 



1151 



4805 



FIGURE 3D 



